[Iluvatar GPU] Adapt VL model #4313

wuyujiji · 2025-09-29T03:33:32Z

在天数硬件上适配VL模型

paddle-bot · 2025-09-29T03:33:37Z

Thanks for your contribution!

Copilot

Pull Request Overview

This PR adapts VL (Vision-Language) models for Iluvatar GPU hardware, implementing platform-specific optimizations and configurations to support multimodal inference on Iluvatar devices.

Pin paddleformers version to 0.3.0 for compatibility
Implement Iluvatar-specific attention backend optimizations for VL models
Add support for text-image processing operations and memory management
Provide comprehensive documentation with installation and usage examples

Reviewed Changes

Copilot reviewed 12 out of 12 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
requirements_iluvatar.txt	Pin paddleformers to version 0.3.0 for stability
fastdeploy/worker/iluvatar_worker.py	Configure Paddle flags for multimodal support
fastdeploy/worker/iluvatar_model_runner.py	Add VL model initialization and rope embedding handling
fastdeploy/worker/gpu_model_runner.py	Import additional Iluvatar-specific operations
fastdeploy/model_executor/models/ernie4_5_vl/modeling_resampler.py	Disable fused matmul bias for Iluvatar platform
fastdeploy/model_executor/models/ernie4_5_vl/image_op.py	Add Iluvatar platform support for image operations
fastdeploy/model_executor/models/ernie4_5_vl/dfnrope/modeling.py	Disable fused matmul bias for Iluvatar platform
fastdeploy/model_executor/layers/rotary_embedding.py	Add Iluvatar-specific rotary embedding handling
fastdeploy/model_executor/layers/attention/iluvatar_attn_backend.py	Implement VL-specific attention metadata and tensor handling
docs/zh/get_started/installation/iluvatar_gpu.md	Add Chinese documentation for VL model usage
docs/get_started/installation/iluvatar_gpu.md	Add English documentation for VL model usage
custom_ops/setup_ops.py	Register additional CUDA operations for Iluvatar support

fastdeploy/model_executor/layers/attention/iluvatar_attn_backend.py

docs/zh/get_started/installation/iluvatar_gpu.md

docs/get_started/installation/iluvatar_gpu.md

fastdeploy/worker/iluvatar_worker.py

requirements_iluvatar.txt

gongshaotian

LGTM

yuanlehome · 2025-10-16T09:58:21Z

custom_ops/setup_ops.py

                "gpu_ops/sample_kernels/top_k_renorm_probs.cu",
+                "gpu_ops/text_image_index_out.cu",
+                "gpu_ops/text_image_gather_scatter.cu",
+                "gpu_ops/extract_text_token_output.cu",


extract_text_token_output已经被废弃，麻烦一并给删掉吧，包括算子实现、单测等等

@yuanlehome 这里指的是删掉custom_ops/gpu_ops/extract_text_token_output.cu，test/operators/test_extract_text_token_output.py和cpp_extentions.cc里关于extract_text_token_output的注册？除了这三处之外，setup_ops.py里"gpu_ops/extract_text_token_output.cu"在metax_gpu里也用到了，我如果删了cu文件的实现会对这里造成影响吧

@yuanlehome 已经删了，麻烦看下删的对不对

heavengate · 2025-10-16T11:12:55Z

custom_ops/gpu_ops/extract_text_token_output.cu

@@ -1,101 +0,0 @@
-// Copyright (c) 2024 PaddlePaddle Authors. All Rights Reserved.


这个custom op确认没有地方使用了么

是的，废弃了

paddle-bot bot added the contributor External developers label Sep 29, 2025

wuyujiji force-pushed the adapt_vl branch from 79f96eb to c72ee5e Compare September 29, 2025 03:36

wuyujiji force-pushed the adapt_vl branch 6 times, most recently from b5fe3b3 to d0a687a Compare October 14, 2025 01:50

YuanRisheng requested a review from Copilot October 15, 2025 08:40

Copilot AI reviewed Oct 15, 2025

View reviewed changes

fastdeploy/model_executor/layers/attention/iluvatar_attn_backend.py Show resolved Hide resolved

docs/zh/get_started/installation/iluvatar_gpu.md Outdated Show resolved Hide resolved

docs/get_started/installation/iluvatar_gpu.md Outdated Show resolved Hide resolved

YuanRisheng previously approved these changes Oct 15, 2025

View reviewed changes

wuyujiji dismissed YuanRisheng’s stale review via a58db43 October 15, 2025 09:00

wuyujiji force-pushed the adapt_vl branch 2 times, most recently from a58db43 to f2c3ff5 Compare October 15, 2025 09:09

YuanRisheng previously approved these changes Oct 16, 2025

View reviewed changes

gongshaotian reviewed Oct 16, 2025

View reviewed changes

fastdeploy/worker/iluvatar_worker.py Show resolved Hide resolved

yuanlehome reviewed Oct 16, 2025

View reviewed changes

requirements_iluvatar.txt Outdated Show resolved Hide resolved

gongshaotian previously approved these changes Oct 16, 2025

View reviewed changes

yuanlehome reviewed Oct 16, 2025

View reviewed changes

[Iluvatar GPU] Adapt VL model

002cbd8

wuyujiji dismissed stale reviews from gongshaotian and YuanRisheng via 002cbd8 October 16, 2025 10:30

wuyujiji force-pushed the adapt_vl branch from f2c3ff5 to 002cbd8 Compare October 16, 2025 10:30

heavengate reviewed Oct 16, 2025

View reviewed changes

yuanlehome approved these changes Oct 16, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Iluvatar GPU] Adapt VL model #4313

[Iluvatar GPU] Adapt VL model #4313

wuyujiji commented Sep 29, 2025

Uh oh!

paddle-bot bot commented Sep 29, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gongshaotian left a comment

Uh oh!

yuanlehome Oct 16, 2025

Uh oh!

wuyujiji Oct 16, 2025 •

edited

Loading

Uh oh!

wuyujiji Oct 16, 2025

Uh oh!

heavengate Oct 16, 2025

Uh oh!

yuanlehome Oct 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

		@@ -1,101 +0,0 @@
		// Copyright (c) 2024 PaddlePaddle Authors. All Rights Reserved.

[Iluvatar GPU] Adapt VL model #4313

Are you sure you want to change the base?

[Iluvatar GPU] Adapt VL model #4313

Conversation

wuyujiji commented Sep 29, 2025

Uh oh!

paddle-bot bot commented Sep 29, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gongshaotian left a comment

Choose a reason for hiding this comment

Uh oh!

yuanlehome Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

wuyujiji Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wuyujiji Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

heavengate Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

yuanlehome Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

wuyujiji Oct 16, 2025 •

edited

Loading